134 research outputs found
The impact of population demography and selection on the genetic architecture of complex traits
Population genetic studies have found evidence for dramatic population growth
in recent human history. It is unclear how this recent population growth,
combined with the effects of negative natural selection, has affected patterns
of deleterious variation, as well as the number, frequencies, and effect sizes
of mutations that contribute risk to complex traits. Here I use simulations
under population genetic models where a proportion of the heritability of the
trait is accounted for by mutations in a subset of the exome. I show that
recent population growth increases the proportion of nonsynonymous variants
segregating in the population, but does not affect the genetic load relative to
that in a population that did not expand. Under a model where a mutation's
effect on a trait is correlated with its effect on fitness, rare variants
explain a greater portion of the additive genetic variance of the trait in a
population that has recently expanded than in a population that did not
recently expand. Further, when using a single-marker test, for a given
false-positive rate and sample size, recent population growth decreases the
expected number of significant association with the trait relative to the
number detected in a population that did not expand. However, in a model where
there is no correlation between a mutation's effect on fitness and the effect
on the trait, common variants account for much of the additive genetic
variance, regardless of demography. Moreover, here demography does not affect
the number of significant association detected. These finding suggest recent
population history may be an important factor influencing the power of
association tests in accounting for the missing heritability of certain complex
traits
Natural selection reduced diversity on human Y chromosomes
The human Y chromosome exhibits surprisingly low levels of genetic diversity.
This could result from neutral processes if the effective population size of
males is reduced relative to females due to a higher variance in the number of
offspring from males than from females. Alternatively, selection acting on new
mutations, and affecting linked neutral sites, could reduce variability on the
Y chromosome. Here, using genome-wide analyses of X, Y, autosomal and
mitochondrial DNA, in combination with extensive population genetic
simulations, we show that low observed Y chromosome variability is not
consistent with a purely neutral model. Instead, we show that models of
purifying selection are consistent with observed Y diversity. Further, the
number of sites estimated to be under purifying selection greatly exceeds the
number of Y-linked coding sites, suggesting the importance of the highly
repetitive ampliconic regions. While we show that purifying selection removing
deleterious mutations can explain the low diversity on the Y chromosome, we
cannot exclude the possibility that positive selection acting on beneficial
mutations could have also reduced diversity in linked neutral regions, and may
have contributed to lowering human Y chromosome diversity. Because the
functional significance of the ampliconic regions is poorly understood, our
findings should motivate future research in this area.Comment: 43 pages, 11 figure
Recommended from our members
Inference of the Distribution of Selection Coefficients for New Nonsynonymous Mutations Using Large Samples.
The distribution of fitness effects (DFE) has considerable importance in population genetics. To date, estimates of the DFE come from studies using a small number of individuals. Thus, estimates of the proportion of moderately to strongly deleterious new mutations may be unreliable because such variants are unlikely to be segregating in the data. Additionally, the true functional form of the DFE is unknown, and estimates of the DFE differ significantly between studies. Here we present a flexible and computationally tractable method, called Fit∂a∂i, to estimate the DFE of new mutations using the site frequency spectrum from a large number of individuals. We apply our approach to the frequency spectrum of 1300 Europeans from the Exome Sequencing Project ESP6400 data set, 1298 Danes from the LuCamp data set, and 432 Europeans from the 1000 Genomes Project to estimate the DFE of deleterious nonsynonymous mutations. We infer significantly fewer (0.38-0.84 fold) strongly deleterious mutations with selection coefficient |s| > 0.01 and more (1.24-1.43 fold) weakly deleterious mutations with selection coefficient |s| < 0.001 compared to previous estimates. Furthermore, a DFE that is a mixture distribution of a point mass at neutrality plus a gamma distribution fits better than a gamma distribution in two of the three data sets. Our results suggest that nearly neutral forces play a larger role in human evolution than previously thought
Lab Retriever: a software tool for calculating likelihood ratios incorporating a probability of drop-out for forensic DNA profiles.
BackgroundTechnological advances have enabled the analysis of very small amounts of DNA in forensic cases. However, the DNA profiles from such evidence are frequently incomplete and can contain contributions from multiple individuals. The complexity of such samples confounds the assessment of the statistical weight of such evidence. One approach to account for this uncertainty is to use a likelihood ratio framework to compare the probability of the evidence profile under different scenarios. While researchers favor the likelihood ratio framework, few open-source software solutions with a graphical user interface implementing these calculations are available for practicing forensic scientists.ResultsTo address this need, we developed Lab Retriever, an open-source, freely available program that forensic scientists can use to calculate likelihood ratios for complex DNA profiles. Lab Retriever adds a graphical user interface, written primarily in JavaScript, on top of a C++ implementation of the previously published R code of Balding. We redesigned parts of the original Balding algorithm to improve computational speed. In addition to incorporating a probability of allelic drop-out and other critical parameters, Lab Retriever computes likelihood ratios for hypotheses that can include up to four unknown contributors to a mixed sample. These computations are completed nearly instantaneously on a modern PC or Mac computer.ConclusionsLab Retriever provides a practical software solution to forensic scientists who wish to assess the statistical weight of evidence for complex DNA profiles. Executable versions of the program are freely available for Mac OSX and Windows operating systems
Genomic Underpinnings of Population Persistence in Isle Royale Moose
Island ecosystems provide natural laboratories to assess the impacts of isolation on population persistence. However, most studies of persistence have focused on a single species, without comparisons to other organisms they interact with in the ecosystem. The case study of moose and gray wolves on Isle Royale allows for a direct contrast of genetic variation in isolated populations that have experienced dramatically differing population trajectories over the past decade. Whereas the Isle Royale wolf population recently declined nearly to extinction due to severe inbreeding depression, the moose population has thrived and continues to persist, despite having low genetic diversity and being isolated for ∼120 years. Here, we examine the patterns of genomic variation underlying the continued persistence of the Isle Royale moose population. We document high levels of inbreeding in the population, roughly as high as the wolf population at the time of its decline. However, inbreeding in the moose population manifests in the form of intermediate-length runs of homozygosity suggestive of historical inbreeding and purging, contrasting with the long runs of homozygosity observed in the smaller wolf population. Using simulations, we confirm that substantial purging has likely occurred in the moose population. However, we also document notable increases in genetic load, which could eventually threaten population viability over the long term. Overall, our results demonstrate a complex relationship between inbreeding, genetic diversity, and population viability that highlights the use of genomic datasets and computational simulation tools for understanding the factors enabling persistence in isolated populations
Assessing the Evolutionary Impact of Amino Acid Mutations in the Human Genome
Quantifying the distribution of fitness effects among newly arising mutations in the human genome is key to resolving important debates in medical and evolutionary genetics. Here, we present a method for inferring this distribution using Single Nucleotide Polymorphism (SNP) data from a population with non-stationary demographic history (such as that of modern humans). Application of our method to 47,576 coding SNPs found by direct resequencing of 11,404 protein coding-genes in 35 individuals (20 European Americans and 15 African Americans) allows us to assess the relative contribution of demographic and selective effects to patterning amino acid variation in the human genome. We find evidence of an ancient population expansion in the sample with African ancestry and a relatively recent bottleneck in the sample with European ancestry. After accounting for these demographic effects, we find strong evidence for great variability in the selective effects of new amino acid replacing mutations. In both populations, the patterns of variation are consistent with a leptokurtic distribution of selection coefficients (e.g., gamma or log-normal) peaked near neutrality. Specifically, we predict 27–29% of amino acid changing (nonsynonymous) mutations are neutral or nearly neutral (|s|<0.01%), 30–42% are moderately deleterious (0.01%<|s|<1%), and nearly all the remainder are highly deleterious or lethal (|s|>1%). Our results are consistent with 10–20% of amino acid differences between humans and chimpanzees having been fixed by positive selection with the remainder of differences being neutral or nearly neutral. Our analysis also predicts that many of the alleles identified via whole-genome association mapping may be selectively neutral or (formerly) positively selected, implying that deleterious genetic variation affecting disease phenotype may be missed by this widely used approach for mapping genes underlying complex traits
A community-maintained standard library of population genetic models
The explosion in population genomic data demands ever more complex modes of analysis, and increasingly, these analyses depend on sophisticated simulations. Recent advances in population genetic simulation have made it possible to simulate large and complex models, but specifying such models for a particular simulation engine remains a difficult and error-prone task. Computational genetics researchers currently re-implement simulation models independently, leading to inconsistency and duplication of effort. This situation presents a major barrier to empirical researchers seeking to use simulations for power analyses of upcoming studies or sanity checks on existing genomic data. Population genetics, as a field, also lacks standard benchmarks by which new tools for inference might be measured. Here, we describe a new resource, stdpopsim, that attempts to rectify this situation. Stdpopsim is a community-driven open source project, which provides easy access to a growing catalog of published simulation models from a range of organisms and supports multiple simulation engine backends. This resource is available as a well-documented python library with a simple command-line interface. We share some examples demonstrating how stdpopsim can be used to systematically compare demographic inference methods, and we encourage a broader community of developers to contribute to this growing resource.Open access journalThis item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at [email protected]
A Simple Genetic Architecture Underlies Morphological Variation in Dogs
The largest genetic study to date of morphology in domestic dogs identifies genes
controlling nearly 100 morphological traits and identifies important trends in
phenotypic variation within this species
- …